Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 244 |
| Missing cells | 678 |
| Missing cells (%) | 6.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 357.8 KiB |
| Average record size in memory | 1.5 KiB |
Variable types
| Categorical | 27 |
|---|---|
| DateTime | 3 |
| Numeric | 8 |
| Unsupported | 2 |
TP_NOT has constant value "2" | Constant |
ID_AGRAVO has constant value "B54" | Constant |
NU_ANO has constant value "2015" | Constant |
ID_REGIONA has constant value "" | Constant |
SG_UF has constant value "33" | Constant |
ID_RG_RESI has constant value "" | Constant |
ID_PAIS has constant value "1" | Constant |
ID_OCUPA_N has a high cardinality: 98 distinct values | High cardinality |
DEXAME has a high cardinality: 128 distinct values | High cardinality |
DTRATA has a high cardinality: 51 distinct values | High cardinality |
SEM_NOT is highly correlated with SEM_PRI | High correlation |
SG_UF_NOT is highly correlated with ID_MUNICIP | High correlation |
ID_MUNICIP is highly correlated with SG_UF_NOT | High correlation |
SEM_PRI is highly correlated with SEM_NOT | High correlation |
SEM_NOT is highly correlated with SEM_PRI | High correlation |
ID_MUNICIP is highly correlated with ID_MN_RESI | High correlation |
SEM_PRI is highly correlated with SEM_NOT | High correlation |
ID_MN_RESI is highly correlated with ID_MUNICIP | High correlation |
COPAISINF is highly correlated with PMM | High correlation |
PMM is highly correlated with COPAISINF | High correlation |
SEM_NOT is highly correlated with SEM_PRI | High correlation |
ID_MUNICIP is highly correlated with ID_MN_RESI | High correlation |
SEM_PRI is highly correlated with SEM_NOT | High correlation |
ID_MN_RESI is highly correlated with ID_MUNICIP | High correlation |
COUFINF is highly correlated with RESULT and 8 other fields | High correlation |
PMM is highly correlated with CS_RACA and 4 other fields | High correlation |
CS_RACA is highly correlated with PMM and 4 other fields | High correlation |
RESULT is highly correlated with COUFINF and 12 other fields | High correlation |
AT_SINTOMA is highly correlated with RESULT and 3 other fields | High correlation |
SG_UF_NOT is highly correlated with ID_MUNICIP | High correlation |
SEM_NOT is highly correlated with DTRATA and 1 other fields | High correlation |
DTRATA is highly correlated with COUFINF and 15 other fields | High correlation |
AT_LAMINA is highly correlated with RESULT and 5 other fields | High correlation |
ID_MUNICIP is highly correlated with SG_UF_NOT | High correlation |
ID_OCUPA_N is highly correlated with COUFINF and 7 other fields | High correlation |
COMUNINF is highly correlated with COUFINF and 9 other fields | High correlation |
CLASSI_FIN is highly correlated with COUFINF and 10 other fields | High correlation |
LOC_INF is highly correlated with COUFINF and 11 other fields | High correlation |
COPAISINF is highly correlated with PMM and 8 other fields | High correlation |
DSTRAESQUE is highly correlated with PMM and 11 other fields | High correlation |
TPAUTOCTO is highly correlated with COUFINF and 7 other fields | High correlation |
CS_GESTANT is highly correlated with CS_SEXO | High correlation |
TRA_ESQUEM is highly correlated with COUFINF and 9 other fields | High correlation |
SEM_PRI is highly correlated with SEM_NOT | High correlation |
AT_ATIVIDA is highly correlated with RESULT and 8 other fields | High correlation |
CS_SEXO is highly correlated with CS_GESTANT | High correlation |
PCRUZ is highly correlated with COUFINF and 11 other fields | High correlation |
ID_MN_RESI is highly correlated with CS_RACA and 3 other fields | High correlation |
COUFINF is highly correlated with ID_REGIONA and 10 other fields | High correlation |
ID_REGIONA is highly correlated with COUFINF and 24 other fields | High correlation |
DTRATA is highly correlated with COUFINF and 14 other fields | High correlation |
CS_ESCOL_N is highly correlated with ID_REGIONA and 6 other fields | High correlation |
ID_OCUPA_N is highly correlated with COUFINF and 7 other fields | High correlation |
DSTRAESQUE is highly correlated with ID_REGIONA and 7 other fields | High correlation |
ID_PAIS is highly correlated with COUFINF and 24 other fields | High correlation |
NU_ANO is highly correlated with COUFINF and 24 other fields | High correlation |
CS_SEXO is highly correlated with ID_REGIONA and 7 other fields | High correlation |
LOC_INF is highly correlated with COUFINF and 11 other fields | High correlation |
SG_UF is highly correlated with COUFINF and 24 other fields | High correlation |
CS_RACA is highly correlated with ID_REGIONA and 6 other fields | High correlation |
RESULT is highly correlated with ID_REGIONA and 13 other fields | High correlation |
AT_SINTOMA is highly correlated with ID_REGIONA and 10 other fields | High correlation |
SG_UF_NOT is highly correlated with ID_REGIONA and 6 other fields | High correlation |
TP_NOT is highly correlated with COUFINF and 24 other fields | High correlation |
AT_LAMINA is highly correlated with ID_REGIONA and 10 other fields | High correlation |
COMUNINF is highly correlated with COUFINF and 10 other fields | High correlation |
TPAUTOCTO is highly correlated with ID_REGIONA and 12 other fields | High correlation |
ID_AGRAVO is highly correlated with COUFINF and 24 other fields | High correlation |
CS_GESTANT is highly correlated with ID_REGIONA and 7 other fields | High correlation |
ID_RG_RESI is highly correlated with COUFINF and 24 other fields | High correlation |
TRA_ESQUEM is highly correlated with ID_REGIONA and 10 other fields | High correlation |
AT_ATIVIDA is highly correlated with ID_REGIONA and 9 other fields | High correlation |
CLASSI_FIN is highly correlated with ID_REGIONA and 13 other fields | High correlation |
PCRUZ is highly correlated with ID_REGIONA and 12 other fields | High correlation |
DT_INVEST has 244 (100.0%) missing values | Missing |
PMM has 190 (77.9%) missing values | Missing |
DT_ENCERRA has 244 (100.0%) missing values | Missing |
DT_INVEST is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
DT_ENCERRA is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
COPAISINF has 175 (71.7%) zeros | Zeros |
Reproduction
| Analysis started | 2021-07-06 18:52:03.424048 |
|---|---|
| Analysis finished | 2021-07-06 18:52:24.155716 |
| Duration | 20.73 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.9 KiB |
| 2 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 244 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 244 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2 | 244 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 244 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 244 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 244 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 244 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 244 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.2 KiB |
| B54 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 732 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B54 |
|---|---|
| 2nd row | B54 |
| 3rd row | B54 |
| 4th row | B54 |
| 5th row | B54 |
Common Values
| Value | Count | Frequency (%) |
| B54 | 244 |
Length
Pie chart
| Value | Count | Frequency (%) |
| b54 | 244 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 244 | |
| 5 | 244 | |
| 4 | 244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 488 | |
| Uppercase Letter | 244 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 244 | |
| 4 | 244 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 488 | |
| Latin | 244 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 244 | |
| 4 | 244 |
Latin
| Value | Count | Frequency (%) |
| B | 244 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 732 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 244 | |
| 5 | 244 | |
| 4 | 244 |
DT_NOTIFIC
Date
| Distinct | 134 |
|---|---|
| Distinct (%) | 54.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 KiB |
| Minimum | 2015-01-02 00:00:00 |
|---|---|
| Maximum | 2015-12-26 00:00:00 |
| Distinct | 48 |
|---|---|
| Distinct (%) | 19.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201517.9016 |
| Minimum | 201453 |
|---|---|
| Maximum | 201551 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 KiB |
Quantile statistics
| Minimum | 201453 |
|---|---|
| 5-th percentile | 201503 |
| Q1 | 201509 |
| median | 201512 |
| Q3 | 201525.25 |
| 95-th percentile | 201548 |
| Maximum | 201551 |
| Range | 98 |
| Interquartile range (IQR) | 16.25 |
Descriptive statistics
| Standard deviation | 14.50665228 |
|---|---|
| Coefficient of variation (CV) | 7.198691609 × 10-5 |
| Kurtosis | 1.163243467 |
| Mean | 201517.9016 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.6077509865 |
| Sum | 49170368 |
| Variance | 210.4429603 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 201509 | 35 | 14.3% |
| 201510 | 18 | 7.4% |
| 201508 | 16 | 6.6% |
| 201514 | 13 | 5.3% |
| 201511 | 10 | 4.1% |
| 201507 | 10 | 4.1% |
| 201506 | 7 | 2.9% |
| 201545 | 6 | 2.5% |
| 201515 | 6 | 2.5% |
| 201513 | 6 | 2.5% |
| Other values (38) | 117 |
| Value | Count | Frequency (%) |
| 201453 | 1 | 0.4% |
| 201501 | 4 | 1.6% |
| 201502 | 4 | 1.6% |
| 201503 | 5 | 2.0% |
| 201504 | 4 | 1.6% |
| 201505 | 6 | 2.5% |
| 201506 | 7 | 2.9% |
| 201507 | 10 | 4.1% |
| 201508 | 16 | |
| 201509 | 35 |
| Value | Count | Frequency (%) |
| 201551 | 2 | 0.8% |
| 201550 | 6 | |
| 201549 | 4 | |
| 201548 | 2 | 0.8% |
| 201547 | 2 | 0.8% |
| 201546 | 3 | |
| 201545 | 6 | |
| 201544 | 1 | 0.4% |
| 201540 | 2 | 0.8% |
| 201539 | 4 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 KiB |
| 2015 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 976 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2015 | 244 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2015 | 244 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 244 | |
| 0 | 244 | |
| 1 | 244 | |
| 5 | 244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 976 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 244 | |
| 0 | 244 | |
| 1 | 244 | |
| 5 | 244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 976 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 244 | |
| 0 | 244 | |
| 1 | 244 | |
| 5 | 244 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 976 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 244 | |
| 0 | 244 | |
| 1 | 244 | |
| 5 | 244 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.2 KiB |
| 33 | |
|---|---|
| 53 | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 488 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 33 |
|---|---|
| 2nd row | 33 |
| 3rd row | 33 |
| 4th row | 33 |
| 5th row | 33 |
Common Values
| Value | Count | Frequency (%) |
| 33 | 243 | |
| 53 | 1 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 33 | 243 | |
| 53 | 1 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 487 | |
| 5 | 1 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 488 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 487 | |
| 5 | 1 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 488 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 487 | |
| 5 | 1 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 488 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 487 | |
| 5 | 1 | 0.2% |
| Distinct | 18 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 331223.2049 |
| Minimum | 330010 |
|---|---|
| Maximum | 530010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 KiB |
Quantile statistics
| Minimum | 330010 |
|---|---|
| 5-th percentile | 330240 |
| Q1 | 330340 |
| median | 330455 |
| Q3 | 330455 |
| 95-th percentile | 330455 |
| Maximum | 530010 |
| Range | 200000 |
| Interquartile range (IQR) | 115 |
Descriptive statistics
| Standard deviation | 12778.74515 |
|---|---|
| Coefficient of variation (CV) | 0.03858046466 |
| Kurtosis | 243.9726141 |
| Mean | 331223.2049 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.61918918 |
| Sum | 80818462 |
| Variance | 163296327.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 330455 | 166 | |
| 330240 | 25 | 10.2% |
| 330340 | 24 | 9.8% |
| 330330 | 7 | 2.9% |
| 330250 | 3 | 1.2% |
| 330490 | 3 | 1.2% |
| 330170 | 3 | 1.2% |
| 330185 | 2 | 0.8% |
| 330610 | 2 | 0.8% |
| 330150 | 1 | 0.4% |
| Other values (8) | 8 | 3.3% |
| Value | Count | Frequency (%) |
| 330010 | 1 | 0.4% |
| 330040 | 1 | 0.4% |
| 330070 | 1 | 0.4% |
| 330150 | 1 | 0.4% |
| 330170 | 3 | 1.2% |
| 330185 | 2 | 0.8% |
| 330240 | 25 | |
| 330250 | 3 | 1.2% |
| 330330 | 7 | 2.9% |
| 330340 | 24 |
| Value | Count | Frequency (%) |
| 530010 | 1 | 0.4% |
| 330630 | 1 | 0.4% |
| 330610 | 2 | 0.8% |
| 330490 | 3 | 1.2% |
| 330455 | 166 | |
| 330452 | 1 | 0.4% |
| 330430 | 1 | 0.4% |
| 330350 | 1 | 0.4% |
| 330340 | 24 | 9.8% |
| 330330 | 7 | 2.9% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 KiB |
Length
| Max length | 0 |
|---|---|
| Median length | 0 |
| Mean length | 0 |
| Min length | 0 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 244 |
Length
Pie chart
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
ID_UNIDADE
Real number (ℝ≥0)
| Distinct | 77 |
|---|---|
| Distinct (%) | 31.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2840699.361 |
| Minimum | 69 |
|---|---|
| Maximum | 7642415 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 KiB |
Quantile statistics
| Minimum | 69 |
|---|---|
| 5-th percentile | 2269783 |
| Q1 | 2276534 |
| median | 2288338 |
| Q3 | 3005992 |
| 95-th percentile | 5465428.35 |
| Maximum | 7642415 |
| Range | 7642346 |
| Interquartile range (IQR) | 729458 |
Descriptive statistics
| Standard deviation | 1345263.264 |
|---|---|
| Coefficient of variation (CV) | 0.47356763 |
| Kurtosis | 2.584384319 |
| Mean | 2840699.361 |
| Median Absolute Deviation (MAD) | 11804 |
| Skewness | 1.554737087 |
| Sum | 693130644 |
| Variance | 1.809733249 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2288338 | 87 | |
| 2276534 | 23 | 9.4% |
| 5462886 | 18 | 7.4% |
| 3005992 | 13 | 5.3% |
| 2271885 | 9 | 3.7% |
| 2272784 | 8 | 3.3% |
| 2269783 | 4 | 1.6% |
| 2273365 | 2 | 0.8% |
| 3607720 | 2 | 0.8% |
| 3211649 | 2 | 0.8% |
| Other values (67) | 76 |
| Value | Count | Frequency (%) |
| 69 | 1 | |
| 76 | 1 | |
| 12513 | 1 | |
| 12548 | 1 | |
| 12599 | 1 | |
| 12831 | 1 | |
| 26050 | 1 | |
| 2269546 | 1 | |
| 2269554 | 1 | |
| 2269651 | 2 |
| Value | Count | Frequency (%) |
| 7642415 | 1 | |
| 7458940 | 1 | |
| 6995462 | 1 | |
| 6938124 | 1 | |
| 6793231 | 1 | |
| 6753469 | 2 | |
| 6734014 | 1 | |
| 6427138 | 1 | |
| 6146376 | 1 | |
| 5476321 | 2 |
DT_SIN_PRI
Date
| Distinct | 153 |
|---|---|
| Distinct (%) | 62.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 KiB |
| Minimum | 2014-09-30 00:00:00 |
|---|---|
| Maximum | 2015-12-25 00:00:00 |
| Distinct | 52 |
|---|---|
| Distinct (%) | 21.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201515.3607 |
| Minimum | 201440 |
|---|---|
| Maximum | 201551 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 KiB |
Quantile statistics
| Minimum | 201440 |
|---|---|
| 5-th percentile | 201502 |
| Q1 | 201507 |
| median | 201510 |
| Q3 | 201524 |
| 95-th percentile | 201545 |
| Maximum | 201551 |
| Range | 111 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 16.53158373 |
|---|---|
| Coefficient of variation (CV) | 8.203634538 × 10-5 |
| Kurtosis | 3.969377108 |
| Mean | 201515.3607 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.5789124383 |
| Sum | 49169748 |
| Variance | 273.2932605 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 201508 | 23 | 9.4% |
| 201509 | 21 | 8.6% |
| 201505 | 17 | 7.0% |
| 201507 | 17 | 7.0% |
| 201506 | 11 | 4.5% |
| 201513 | 10 | 4.1% |
| 201503 | 8 | 3.3% |
| 201514 | 8 | 3.3% |
| 201510 | 7 | 2.9% |
| 201537 | 7 | 2.9% |
| Other values (42) | 115 |
| Value | Count | Frequency (%) |
| 201440 | 1 | 0.4% |
| 201451 | 1 | 0.4% |
| 201452 | 2 | 0.8% |
| 201453 | 1 | 0.4% |
| 201501 | 5 | 2.0% |
| 201502 | 6 | 2.5% |
| 201503 | 8 | |
| 201504 | 3 | 1.2% |
| 201505 | 17 | |
| 201506 | 11 |
| Value | Count | Frequency (%) |
| 201551 | 1 | 0.4% |
| 201550 | 2 | |
| 201549 | 4 | |
| 201548 | 1 | 0.4% |
| 201547 | 1 | 0.4% |
| 201546 | 3 | |
| 201545 | 4 | |
| 201544 | 4 | |
| 201543 | 1 | 0.4% |
| 201541 | 1 | 0.4% |
DT_NASC
Date
| Distinct | 233 |
|---|---|
| Distinct (%) | 95.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 KiB |
| Minimum | 1940-04-17 00:00:00 |
|---|---|
| Maximum | 2014-05-05 00:00:00 |
NU_IDADE_N
Real number (ℝ≥0)
| Distinct | 64 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4035.172131 |
| Minimum | 3010 |
|---|---|
| Maximum | 4074 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 KiB |
Quantile statistics
| Minimum | 3010 |
|---|---|
| 5-th percentile | 4010 |
| Q1 | 4028 |
| median | 4039 |
| Q3 | 4052 |
| 95-th percentile | 4066.85 |
| Maximum | 4074 |
| Range | 1064 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 67.88009189 |
|---|---|
| Coefficient of variation (CV) | 0.01682210565 |
| Kurtosis | 216.4219365 |
| Mean | 4035.172131 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -14.28346634 |
| Sum | 984582 |
| Variance | 4607.706874 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4028 | 12 | 4.9% |
| 4033 | 11 | 4.5% |
| 4040 | 9 | 3.7% |
| 4034 | 9 | 3.7% |
| 4057 | 8 | 3.3% |
| 4042 | 7 | 2.9% |
| 4031 | 7 | 2.9% |
| 4051 | 7 | 2.9% |
| 4052 | 7 | 2.9% |
| 4002 | 6 | 2.5% |
| Other values (54) | 161 |
| Value | Count | Frequency (%) |
| 3010 | 1 | 0.4% |
| 4002 | 6 | |
| 4004 | 1 | 0.4% |
| 4007 | 2 | 0.8% |
| 4008 | 2 | 0.8% |
| 4010 | 2 | 0.8% |
| 4013 | 2 | 0.8% |
| 4014 | 1 | 0.4% |
| 4015 | 1 | 0.4% |
| 4016 | 3 |
| Value | Count | Frequency (%) |
| 4074 | 2 | |
| 4072 | 3 | |
| 4071 | 2 | |
| 4069 | 1 | 0.4% |
| 4068 | 3 | |
| 4067 | 2 | |
| 4066 | 1 | 0.4% |
| 4065 | 4 | |
| 4064 | 1 | 0.4% |
| 4063 | 1 | 0.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.9 KiB |
| M | |
|---|---|
| F |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 244 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 173 | |
| F | 71 |
Length
Pie chart
| Value | Count | Frequency (%) |
| m | 173 | |
| f | 71 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 173 | |
| F | 71 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 244 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 173 | |
| F | 71 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 244 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 173 | |
| F | 71 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 244 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 173 | |
| F | 71 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.9 KiB |
| 6 | |
|---|---|
| 5 | |
| 9 | 6 |
| 1 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 244 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 5 |
| 3rd row | 6 |
| 4th row | 6 |
| 5th row | 6 |
Common Values
| Value | Count | Frequency (%) |
| 6 | 189 | |
| 5 | 48 | 19.7% |
| 9 | 6 | 2.5% |
| 1 | 1 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 6 | 189 | |
| 5 | 48 | 19.7% |
| 9 | 6 | 2.5% |
| 1 | 1 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 189 | |
| 5 | 48 | 19.7% |
| 9 | 6 | 2.5% |
| 1 | 1 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 244 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 189 | |
| 5 | 48 | 19.7% |
| 9 | 6 | 2.5% |
| 1 | 1 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 244 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 189 | |
| 5 | 48 | 19.7% |
| 9 | 6 | 2.5% |
| 1 | 1 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 244 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 189 | |
| 5 | 48 | 19.7% |
| 9 | 6 | 2.5% |
| 1 | 1 | 0.4% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 KiB |
| 1 | |
|---|---|
| 9 | |
| 4 | |
| 2 | |
| 7 | |
| Other values (2) | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 0.9713114754 |
| Min length | 0 |
Characters and Unicode
| Total characters | 237 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 120 | |
| 9 | 71 | |
| 4 | 23 | 9.4% |
| 2 | 21 | 8.6% |
| 7 | 2.9% | |
| 5 | 1 | 0.4% |
| 3 | 1 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 120 | |
| 9 | 71 | |
| 4 | 23 | 9.7% |
| 2 | 21 | 8.9% |
| 5 | 1 | 0.4% |
| 3 | 1 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 120 | |
| 9 | 71 | |
| 4 | 23 | 9.7% |
| 2 | 21 | 8.9% |
| 3 | 1 | 0.4% |
| 5 | 1 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 237 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 120 | |
| 9 | 71 | |
| 4 | 23 | 9.7% |
| 2 | 21 | 8.9% |
| 3 | 1 | 0.4% |
| 5 | 1 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 237 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 120 | |
| 9 | 71 | |
| 4 | 23 | 9.7% |
| 2 | 21 | 8.9% |
| 3 | 1 | 0.4% |
| 5 | 1 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 237 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 120 | |
| 9 | 71 | |
| 4 | 23 | 9.7% |
| 2 | 21 | 8.9% |
| 3 | 1 | 0.4% |
| 5 | 1 | 0.4% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.2 KiB |
| 08 | |
|---|---|
| 09 | |
| 06 | |
| 07 | |
| Other values (7) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.893442623 |
| Min length | 0 |
Characters and Unicode
| Total characters | 462 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 08 |
| 3rd row | 08 |
| 4th row | 07 |
| 5th row | 08 |
Common Values
| Value | Count | Frequency (%) |
| 08 | 97 | |
| 09 | 58 | |
| 06 | 19 | 7.8% |
| 13 | 5.3% | |
| 07 | 11 | 4.5% |
| 05 | 11 | 4.5% |
| 10 | 8 | 3.3% |
| 03 | 8 | 3.3% |
| 01 | 7 | 2.9% |
| 02 | 5 | 2.0% |
| Other values (2) | 7 | 2.9% |
Length
| Value | Count | Frequency (%) |
| 08 | 97 | |
| 09 | 58 | |
| 06 | 19 | 8.2% |
| 07 | 11 | 4.8% |
| 05 | 11 | 4.8% |
| 10 | 8 | 3.5% |
| 03 | 8 | 3.5% |
| 01 | 7 | 3.0% |
| 02 | 5 | 2.2% |
| 04 | 4 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 234 | |
| 8 | 97 | |
| 9 | 58 | 12.6% |
| 6 | 19 | 4.1% |
| 1 | 15 | 3.2% |
| 7 | 11 | 2.4% |
| 5 | 11 | 2.4% |
| 3 | 8 | 1.7% |
| 2 | 5 | 1.1% |
| 4 | 4 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 462 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 234 | |
| 8 | 97 | |
| 9 | 58 | 12.6% |
| 6 | 19 | 4.1% |
| 1 | 15 | 3.2% |
| 7 | 11 | 2.4% |
| 5 | 11 | 2.4% |
| 3 | 8 | 1.7% |
| 2 | 5 | 1.1% |
| 4 | 4 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 462 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 234 | |
| 8 | 97 | |
| 9 | 58 | 12.6% |
| 6 | 19 | 4.1% |
| 1 | 15 | 3.2% |
| 7 | 11 | 2.4% |
| 5 | 11 | 2.4% |
| 3 | 8 | 1.7% |
| 2 | 5 | 1.1% |
| 4 | 4 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 462 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 234 | |
| 8 | 97 | |
| 9 | 58 | 12.6% |
| 6 | 19 | 4.1% |
| 1 | 15 | 3.2% |
| 7 | 11 | 2.4% |
| 5 | 11 | 2.4% |
| 3 | 8 | 1.7% |
| 2 | 5 | 1.1% |
| 4 | 4 | 0.9% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.2 KiB |
| 33 |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 488 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 33 |
|---|---|
| 2nd row | 33 |
| 3rd row | 33 |
| 4th row | 33 |
| 5th row | 33 |
Common Values
| Value | Count | Frequency (%) |
| 33 | 244 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 33 | 244 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 488 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 488 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 488 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 488 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 488 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 488 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 488 |
| Distinct | 26 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 330376.1762 |
| Minimum | 330010 |
|---|---|
| Maximum | 330630 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 KiB |
Quantile statistics
| Minimum | 330010 |
|---|---|
| 5-th percentile | 330170 |
| Q1 | 330330 |
| median | 330455 |
| Q3 | 330455 |
| 95-th percentile | 330455 |
| Maximum | 330630 |
| Range | 620 |
| Interquartile range (IQR) | 125 |
Descriptive statistics
| Standard deviation | 116.4247832 |
|---|---|
| Coefficient of variation (CV) | 0.0003524006618 |
| Kurtosis | 0.4681075846 |
| Mean | 330376.1762 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.9495002607 |
| Sum | 80611787 |
| Variance | 13554.73013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 330455 | 126 | |
| 330340 | 27 | 11.1% |
| 330240 | 24 | 9.8% |
| 330330 | 20 | 8.2% |
| 330170 | 6 | 2.5% |
| 330190 | 5 | 2.0% |
| 330490 | 4 | 1.6% |
| 330350 | 3 | 1.2% |
| 330185 | 3 | 1.2% |
| 330250 | 3 | 1.2% |
| Other values (16) | 23 | 9.4% |
| Value | Count | Frequency (%) |
| 330010 | 1 | 0.4% |
| 330023 | 1 | 0.4% |
| 330040 | 3 | |
| 330070 | 2 | 0.8% |
| 330130 | 1 | 0.4% |
| 330150 | 3 | |
| 330170 | 6 | |
| 330185 | 3 | |
| 330190 | 5 | |
| 330200 | 1 | 0.4% |
| Value | Count | Frequency (%) |
| 330630 | 1 | 0.4% |
| 330610 | 2 | 0.8% |
| 330590 | 1 | 0.4% |
| 330580 | 1 | 0.4% |
| 330560 | 1 | 0.4% |
| 330490 | 4 | 1.6% |
| 330455 | 126 | |
| 330452 | 2 | 0.8% |
| 330430 | 1 | 0.4% |
| 330360 | 1 | 0.4% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 KiB |
Length
| Max length | 0 |
|---|---|
| Median length | 0 |
| Mean length | 0 |
| Min length | 0 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 244 |
Length
Pie chart
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.9 KiB |
| 1 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 244 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 244 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 244 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 244 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 244 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 244 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 244 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 244 |
| Distinct | 98 |
|---|---|
| Distinct (%) | 40.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.0 KiB |
| 999991 | |
|---|---|
| 214205 | 12 |
| 999993 | 8 |
| 262105 | 4 |
| Other values (93) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 4.401639344 |
| Min length | 0 |
Characters and Unicode
| Total characters | 1074 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 69 ? |
|---|---|
| Unique (%) | 28.3% |
Sample
| 1st row | |
|---|---|
| 2nd row | 221105 |
| 3rd row | |
| 4th row | 782305 |
| 5th row | 262105 |
Common Values
| Value | Count | Frequency (%) |
| 65 | ||
| 999991 | 26 | 10.7% |
| 214205 | 12 | 4.9% |
| 999993 | 8 | 3.3% |
| 262105 | 4 | 1.6% |
| 241005 | 4 | 1.6% |
| 252105 | 4 | 1.6% |
| 221105 | 4 | 1.6% |
| 999992 | 3 | 1.2% |
| 715210 | 3 | 1.2% |
| Other values (88) | 111 |
Length
| Value | Count | Frequency (%) |
| 999991 | 26 | 14.5% |
| 214205 | 12 | 6.7% |
| 999993 | 8 | 4.5% |
| 241005 | 4 | 2.2% |
| 252105 | 4 | 2.2% |
| 221105 | 4 | 2.2% |
| 262105 | 4 | 2.2% |
| 223115 | 3 | 1.7% |
| 782305 | 3 | 1.7% |
| 715210 | 3 | 1.7% |
| Other values (87) | 108 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 9 | 199 | |
| 2 | 170 | |
| 5 | 152 | |
| 0 | 145 | |
| 3 | 77 | 7.2% |
| 4 | 66 | 6.1% |
| 7 | 29 | 2.7% |
| 6 | 26 | 2.4% |
| 8 | 10 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1074 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 9 | 199 | |
| 2 | 170 | |
| 5 | 152 | |
| 0 | 145 | |
| 3 | 77 | 7.2% |
| 4 | 66 | 6.1% |
| 7 | 29 | 2.7% |
| 6 | 26 | 2.4% |
| 8 | 10 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1074 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 9 | 199 | |
| 2 | 170 | |
| 5 | 152 | |
| 0 | 145 | |
| 3 | 77 | 7.2% |
| 4 | 66 | 6.1% |
| 7 | 29 | 2.7% |
| 6 | 26 | 2.4% |
| 8 | 10 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1074 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 9 | 199 | |
| 2 | 170 | |
| 5 | 152 | |
| 0 | 145 | |
| 3 | 77 | 7.2% |
| 4 | 66 | 6.1% |
| 7 | 29 | 2.7% |
| 6 | 26 | 2.4% |
| 8 | 10 | 0.9% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 KiB |
| 2 | |
|---|---|
| 1 | |
| 8 | 4 |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 0.9959016393 |
| Min length | 0 |
Characters and Unicode
| Total characters | 243 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 171 | |
| 1 | 68 | 27.9% |
| 8 | 4 | 1.6% |
| 1 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2 | 171 | |
| 1 | 68 | 28.0% |
| 8 | 4 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 171 | |
| 1 | 68 | 28.0% |
| 8 | 4 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 243 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 171 | |
| 1 | 68 | 28.0% |
| 8 | 4 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 243 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 171 | |
| 1 | 68 | 28.0% |
| 8 | 4 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 243 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 171 | |
| 1 | 68 | 28.0% |
| 8 | 4 | 1.6% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 KiB |
| 11 | |
|---|---|
| 10 | |
| 4 | |
| 99 | |
| 3 | |
| Other values (5) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.659836066 |
| Min length | 0 |
Characters and Unicode
| Total characters | 405 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 11 |
|---|---|
| 2nd row | 10 |
| 3rd row | 4 |
| 4th row | 11 |
| 5th row | 10 |
Common Values
| Value | Count | Frequency (%) |
| 11 | 75 | |
| 10 | 72 | |
| 4 | 50 | |
| 99 | 17 | 7.0% |
| 3 | 9 | 3.7% |
| 9 | 6 | 2.5% |
| 6 | 2.5% | |
| 1 | 4 | 1.6% |
| 12 | 3 | 1.2% |
| 5 | 2 | 0.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 11 | 75 | |
| 10 | 72 | |
| 4 | 50 | |
| 99 | 17 | 7.1% |
| 3 | 9 | 3.8% |
| 9 | 6 | 2.5% |
| 1 | 4 | 1.7% |
| 12 | 3 | 1.3% |
| 5 | 2 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 229 | |
| 0 | 72 | 17.8% |
| 4 | 50 | 12.3% |
| 9 | 40 | 9.9% |
| 3 | 9 | 2.2% |
| 2 | 3 | 0.7% |
| 5 | 2 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 405 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 229 | |
| 0 | 72 | 17.8% |
| 4 | 50 | 12.3% |
| 9 | 40 | 9.9% |
| 3 | 9 | 2.2% |
| 2 | 3 | 0.7% |
| 5 | 2 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 405 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 229 | |
| 0 | 72 | 17.8% |
| 4 | 50 | 12.3% |
| 9 | 40 | 9.9% |
| 3 | 9 | 2.2% |
| 2 | 3 | 0.7% |
| 5 | 2 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 405 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 229 | |
| 0 | 72 | 17.8% |
| 4 | 50 | 12.3% |
| 9 | 40 | 9.9% |
| 3 | 9 | 2.2% |
| 2 | 3 | 0.7% |
| 5 | 2 | 0.5% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 KiB |
| 1 | |
|---|---|
| 2 | |
| 6 | |
| 3 | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 0.9754098361 |
| Min length | 0 |
Characters and Unicode
| Total characters | 238 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 2 | 69 | |
| 6 | 2.5% | |
| 3 | 3 | 1.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 2 | 69 | |
| 3 | 3 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 2 | 69 | |
| 3 | 3 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 238 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 2 | 69 | |
| 3 | 3 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 238 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 2 | 69 | |
| 3 | 3 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 238 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 2 | 69 | |
| 3 | 3 | 1.3% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 KiB |
| 1 | |
|---|---|
| 2 | 15 |
| 6 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 0.9754098361 |
| Min length | 0 |
Characters and Unicode
| Total characters | 238 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 223 | |
| 2 | 15 | 6.1% |
| 6 | 2.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 223 | |
| 2 | 15 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 223 | |
| 2 | 15 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 238 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 223 | |
| 2 | 15 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 238 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 223 | |
| 2 | 15 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 238 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 223 | |
| 2 | 15 | 6.3% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.0 KiB |
| 2 | |
|---|---|
| 1 | 11 |
| 3 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 0 |
| Mean length | 0.2868852459 |
| Min length | 0 |
Characters and Unicode
| Total characters | 70 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | |
|---|---|
| 2nd row | 2 |
| 3rd row | |
| 4th row | 2 |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 174 | ||
| 2 | 58 | 23.8% |
| 1 | 11 | 4.5% |
| 3 | 1 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2 | 58 | |
| 1 | 11 | 15.7% |
| 3 | 1 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 58 | |
| 1 | 11 | 15.7% |
| 3 | 1 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 70 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 58 | |
| 1 | 11 | 15.7% |
| 3 | 1 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 70 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 58 | |
| 1 | 11 | 15.7% |
| 3 | 1 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 58 | |
| 1 | 11 | 15.7% |
| 3 | 1 | 1.4% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.1 KiB |
| RJ | |
|---|---|
| AM | 4 |
| AP | 2 |
| SP | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 0 |
| Mean length | 0.3524590164 |
| Min length | 0 |
Characters and Unicode
| Total characters | 86 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | |
|---|---|
| 2nd row | AM |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 201 | ||
| RJ | 35 | 14.3% |
| AM | 4 | 1.6% |
| AP | 2 | 0.8% |
| SP | 1 | 0.4% |
| MA | 1 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| rj | 35 | |
| am | 4 | 9.3% |
| ap | 2 | 4.7% |
| ma | 1 | 2.3% |
| sp | 1 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 35 | |
| J | 35 | |
| A | 7 | 8.1% |
| M | 5 | 5.8% |
| P | 3 | 3.5% |
| S | 1 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 86 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 35 | |
| J | 35 | |
| A | 7 | 8.1% |
| M | 5 | 5.8% |
| P | 3 | 3.5% |
| S | 1 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 86 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 35 | |
| J | 35 | |
| A | 7 | 8.1% |
| M | 5 | 5.8% |
| P | 3 | 3.5% |
| S | 1 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 35 | |
| J | 35 | |
| A | 7 | 8.1% |
| M | 5 | 5.8% |
| P | 3 | 3.5% |
| S | 1 | 1.2% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.422131148 |
| Minimum | 0 |
|---|---|
| Maximum | 176 |
| Zeros | 175 |
| Zeros (%) | 71.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 31 |
| Maximum | 176 |
| Range | 176 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 24.50980492 |
|---|---|
| Coefficient of variation (CV) | 3.816459732 |
| Kurtosis | 26.45366574 |
| Mean | 6.422131148 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.050835183 |
| Sum | 1567 |
| Variance | 600.730537 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 175 | |
| 1 | 43 | 17.6% |
| 31 | 15 | 6.1% |
| 138 | 3 | 1.2% |
| 22 | 2 | 0.8% |
| 176 | 1 | 0.4% |
| 153 | 1 | 0.4% |
| 128 | 1 | 0.4% |
| 109 | 1 | 0.4% |
| 28 | 1 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 175 | |
| 1 | 43 | 17.6% |
| 7 | 1 | 0.4% |
| 22 | 2 | 0.8% |
| 28 | 1 | 0.4% |
| 31 | 15 | 6.1% |
| 109 | 1 | 0.4% |
| 128 | 1 | 0.4% |
| 138 | 3 | 1.2% |
| 153 | 1 | 0.4% |
| Value | Count | Frequency (%) |
| 176 | 1 | 0.4% |
| 153 | 1 | 0.4% |
| 138 | 3 | 1.2% |
| 128 | 1 | 0.4% |
| 109 | 1 | 0.4% |
| 31 | 15 | 6.1% |
| 28 | 1 | 0.4% |
| 22 | 2 | 0.8% |
| 7 | 1 | 0.4% |
| 1 | 43 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 KiB |
| 330340 | 14 |
|---|---|
| 330290 | 5 |
| 330240 | 4 |
| 330185 | 3 |
| Other values (11) | 16 |
Length
| Max length | 6 |
|---|---|
| Median length | 0 |
| Mean length | 1.032786885 |
| Min length | 0 |
Characters and Unicode
| Total characters | 252 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | |
|---|---|
| 2nd row | 130260 |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 202 | ||
| 330340 | 14 | 5.7% |
| 330290 | 5 | 2.0% |
| 330240 | 4 | 1.6% |
| 330185 | 3 | 1.2% |
| 330390 | 3 | 1.2% |
| 330250 | 2 | 0.8% |
| 160030 | 2 | 0.8% |
| 130120 | 2 | 0.8% |
| 130260 | 1 | 0.4% |
| Other values (6) | 6 | 2.5% |
Length
| Value | Count | Frequency (%) |
| 330340 | 14 | |
| 330290 | 5 | 11.9% |
| 330240 | 4 | 9.5% |
| 330185 | 3 | 7.1% |
| 330390 | 3 | 7.1% |
| 330250 | 2 | 4.8% |
| 160030 | 2 | 4.8% |
| 130120 | 2 | 4.8% |
| 130260 | 1 | 2.4% |
| 330580 | 1 | 2.4% |
| Other values (5) | 5 | 11.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 93 | |
| 0 | 84 | |
| 4 | 19 | 7.5% |
| 2 | 16 | 6.3% |
| 1 | 13 | 5.2% |
| 5 | 10 | 4.0% |
| 9 | 8 | 3.2% |
| 8 | 5 | 2.0% |
| 6 | 3 | 1.2% |
| 7 | 1 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 252 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 93 | |
| 0 | 84 | |
| 4 | 19 | 7.5% |
| 2 | 16 | 6.3% |
| 1 | 13 | 5.2% |
| 5 | 10 | 4.0% |
| 9 | 8 | 3.2% |
| 8 | 5 | 2.0% |
| 6 | 3 | 1.2% |
| 7 | 1 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 252 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 93 | |
| 0 | 84 | |
| 4 | 19 | 7.5% |
| 2 | 16 | 6.3% |
| 1 | 13 | 5.2% |
| 5 | 10 | 4.0% |
| 9 | 8 | 3.2% |
| 8 | 5 | 2.0% |
| 6 | 3 | 1.2% |
| 7 | 1 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 252 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 93 | |
| 0 | 84 | |
| 4 | 19 | 7.5% |
| 2 | 16 | 6.3% |
| 1 | 13 | 5.2% |
| 5 | 10 | 4.0% |
| 9 | 8 | 3.2% |
| 8 | 5 | 2.0% |
| 6 | 3 | 1.2% |
| 7 | 1 | 0.4% |
| Distinct | 17 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 KiB |
| LUAN | 9 |
|---|---|
| VALE | 7 |
| LUMI | 5 |
| SANA | 3 |
| Other values (12) | 17 |
Length
| Max length | 4 |
|---|---|
| Median length | 0 |
| Mean length | 0.6680327869 |
| Min length | 0 |
Characters and Unicode
| Total characters | 163 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | LUAN |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 203 | ||
| LUAN | 9 | 3.7% |
| VALE | 7 | 2.9% |
| LUMI | 5 | 2.0% |
| SANA | 3 | 1.2% |
| ANGO | 3 | 1.2% |
| MACA | 2 | 0.8% |
| MONT | 2 | 0.8% |
| BENF | 2 | 0.8% |
| SAO | 1 | 0.4% |
| Other values (7) | 7 | 2.9% |
Length
| Value | Count | Frequency (%) |
| luan | 9 | |
| vale | 7 | |
| lumi | 5 | |
| ango | 3 | 7.3% |
| sana | 3 | 7.3% |
| mont | 2 | 4.9% |
| maca | 2 | 4.9% |
| benf | 2 | 4.9% |
| serr | 1 | 2.4% |
| pedr | 1 | 2.4% |
| Other values (6) | 6 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 34 | |
| L | 21 | |
| N | 20 | |
| E | 15 | |
| U | 14 | |
| M | 12 | 7.4% |
| V | 8 | 4.9% |
| O | 7 | 4.3% |
| I | 5 | 3.1% |
| S | 5 | 3.1% |
| Other values (10) | 22 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 163 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 34 | |
| L | 21 | |
| N | 20 | |
| E | 15 | |
| U | 14 | |
| M | 12 | 7.4% |
| V | 8 | 4.9% |
| O | 7 | 4.3% |
| I | 5 | 3.1% |
| S | 5 | 3.1% |
| Other values (10) | 22 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 163 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 34 | |
| L | 21 | |
| N | 20 | |
| E | 15 | |
| U | 14 | |
| M | 12 | 7.4% |
| V | 8 | 4.9% |
| O | 7 | 4.3% |
| I | 5 | 3.1% |
| S | 5 | 3.1% |
| Other values (10) | 22 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 163 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 34 | |
| L | 21 | |
| N | 20 | |
| E | 15 | |
| U | 14 | |
| M | 12 | 7.4% |
| V | 8 | 4.9% |
| O | 7 | 4.3% |
| I | 5 | 3.1% |
| S | 5 | 3.1% |
| Other values (10) | 22 |
| Distinct | 128 |
|---|---|
| Distinct (%) | 52.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.1 KiB |
| 2015-03-02 | 12 |
|---|---|
| 2015-03-06 | 8 |
| 2015-02-27 | 8 |
| 2015-03-05 | 7 |
| None | 6 |
| Other values (123) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.852459016 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2404 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | 29.5% |
Sample
| 1st row | 2015-01-02 |
|---|---|
| 2nd row | 2015-01-06 |
| 3rd row | 2015-01-08 |
| 4th row | 2015-01-09 |
| 5th row | 2015-01-09 |
Common Values
| Value | Count | Frequency (%) |
| 2015-03-02 | 12 | 4.9% |
| 2015-03-06 | 8 | 3.3% |
| 2015-02-27 | 8 | 3.3% |
| 2015-03-05 | 7 | 2.9% |
| None | 6 | 2.5% |
| 2015-02-19 | 6 | 2.5% |
| 2015-03-30 | 5 | 2.0% |
| 2015-03-09 | 5 | 2.0% |
| 2015-02-20 | 4 | 1.6% |
| 2015-02-12 | 4 | 1.6% |
| Other values (118) | 179 |
Length
| Value | Count | Frequency (%) |
| 2015-03-02 | 12 | 4.9% |
| 2015-03-06 | 8 | 3.3% |
| 2015-02-27 | 8 | 3.3% |
| 2015-03-05 | 7 | 2.9% |
| none | 6 | 2.5% |
| 2015-02-19 | 6 | 2.5% |
| 2015-03-30 | 5 | 2.0% |
| 2015-03-09 | 5 | 2.0% |
| 2015-02-20 | 4 | 1.6% |
| 2015-02-12 | 4 | 1.6% |
| Other values (118) | 179 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 560 | |
| - | 476 | |
| 1 | 398 | |
| 2 | 386 | |
| 5 | 275 | |
| 3 | 102 | 4.2% |
| 9 | 51 | 2.1% |
| 4 | 44 | 1.8% |
| 6 | 39 | 1.6% |
| 7 | 26 | 1.1% |
| Other values (5) | 47 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1904 | |
| Dash Punctuation | 476 | 19.8% |
| Lowercase Letter | 18 | 0.7% |
| Uppercase Letter | 6 | 0.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 560 | |
| 1 | 398 | |
| 2 | 386 | |
| 5 | 275 | |
| 3 | 102 | 5.4% |
| 9 | 51 | 2.7% |
| 4 | 44 | 2.3% |
| 6 | 39 | 2.0% |
| 7 | 26 | 1.4% |
| 8 | 23 | 1.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| n | 6 | |
| e | 6 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 476 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2380 | |
| Latin | 24 | 1.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 560 | |
| - | 476 | |
| 1 | 398 | |
| 2 | 386 | |
| 5 | 275 | |
| 3 | 102 | 4.3% |
| 9 | 51 | 2.1% |
| 4 | 44 | 1.8% |
| 6 | 39 | 1.6% |
| 7 | 26 | 1.1% |
Latin
| Value | Count | Frequency (%) |
| N | 6 | |
| o | 6 | |
| n | 6 | |
| e | 6 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2404 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 560 | |
| - | 476 | |
| 1 | 398 | |
| 2 | 386 | |
| 5 | 275 | |
| 3 | 102 | 4.2% |
| 9 | 51 | 2.1% |
| 4 | 44 | 1.8% |
| 6 | 39 | 1.6% |
| 7 | 26 | 1.1% |
| Other values (5) | 47 | 2.0% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 KiB |
| 1 | |
|---|---|
| 4 | |
| 2 | |
| 6 | |
| 8 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 0.9754098361 |
| Min length | 0 |
Characters and Unicode
| Total characters | 238 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 4 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 170 | |
| 4 | 44 | 18.0% |
| 2 | 22 | 9.0% |
| 6 | 2.5% | |
| 8 | 1 | 0.4% |
| 7 | 1 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 170 | |
| 4 | 44 | 18.5% |
| 2 | 22 | 9.2% |
| 8 | 1 | 0.4% |
| 7 | 1 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 170 | |
| 4 | 44 | 18.5% |
| 2 | 22 | 9.2% |
| 8 | 1 | 0.4% |
| 7 | 1 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 238 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 170 | |
| 4 | 44 | 18.5% |
| 2 | 22 | 9.2% |
| 8 | 1 | 0.4% |
| 7 | 1 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 238 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 170 | |
| 4 | 44 | 18.5% |
| 2 | 22 | 9.2% |
| 8 | 1 | 0.4% |
| 7 | 1 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 238 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 170 | |
| 4 | 44 | 18.5% |
| 2 | 22 | 9.2% |
| 8 | 1 | 0.4% |
| 7 | 1 | 0.4% |
| Distinct | 47 |
|---|---|
| Distinct (%) | 87.0% |
| Missing | 190 |
| Missing (%) | 77.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10146.27778 |
| Minimum | 1 |
|---|---|
| Maximum | 100001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 58.4 |
| Q1 | 231.25 |
| median | 548 |
| Q3 | 5520 |
| 95-th percentile | 82772.05 |
| Maximum | 100001 |
| Range | 100000 |
| Interquartile range (IQR) | 5288.75 |
Descriptive statistics
| Standard deviation | 24717.1395 |
|---|---|
| Coefficient of variation (CV) | 2.43607952 |
| Kurtosis | 8.181322948 |
| Mean | 10146.27778 |
| Median Absolute Deviation (MAD) | 460 |
| Skewness | 3.026917605 |
| Sum | 547899 |
| Variance | 610936985.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 301 | 3 | 1.2% |
| 64 | 3 | 1.2% |
| 200 | 2 | 0.8% |
| 112 | 2 | 0.8% |
| 480 | 2 | 0.8% |
| 800 | 1 | 0.4% |
| 310 | 1 | 0.4% |
| 320 | 1 | 0.4% |
| 810 | 1 | 0.4% |
| 576 | 1 | 0.4% |
| Other values (37) | 37 | 15.2% |
| (Missing) | 190 |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.4% |
| 16 | 1 | 0.4% |
| 48 | 1 | 0.4% |
| 64 | 3 | |
| 80 | 1 | 0.4% |
| 96 | 1 | 0.4% |
| 112 | 2 | |
| 128 | 1 | 0.4% |
| 200 | 2 | |
| 208 | 1 | 0.4% |
| Value | Count | Frequency (%) |
| 100001 | 1 | |
| 100000 | 1 | |
| 93863 | 1 | |
| 76800 | 1 | |
| 42320 | 1 | |
| 17800 | 1 | |
| 15720 | 1 | |
| 13160 | 1 | |
| 12400 | 1 | |
| 12000 | 1 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.0 KiB |
| 4 | |
|---|---|
| 3 | 14 |
| 1 | 13 |
| 5 | 13 |
| Other values (2) | 9 |
Length
| Max length | 1 |
|---|---|
| Median length | 0 |
| Mean length | 0.2786885246 |
| Min length | 0 |
Characters and Unicode
| Total characters | 68 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | 4 |
| 3rd row | |
| 4th row | 5 |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 176 | ||
| 4 | 19 | 7.8% |
| 3 | 14 | 5.7% |
| 1 | 13 | 5.3% |
| 5 | 13 | 5.3% |
| 2 | 5 | 2.0% |
| 6 | 4 | 1.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 4 | 19 | |
| 3 | 14 | |
| 1 | 13 | |
| 5 | 13 | |
| 2 | 5 | 7.4% |
| 6 | 4 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 19 | |
| 3 | 14 | |
| 5 | 13 | |
| 1 | 13 | |
| 2 | 5 | 7.4% |
| 6 | 4 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 19 | |
| 3 | 14 | |
| 5 | 13 | |
| 1 | 13 | |
| 2 | 5 | 7.4% |
| 6 | 4 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 68 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 19 | |
| 3 | 14 | |
| 5 | 13 | |
| 1 | 13 | |
| 2 | 5 | 7.4% |
| 6 | 4 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 19 | |
| 3 | 14 | |
| 5 | 13 | |
| 1 | 13 | |
| 2 | 5 | 7.4% |
| 6 | 4 | 5.9% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.1 KiB |
| 1 | |
|---|---|
| 99 | |
| 11 | 7 |
| 4 | 1 |
| Other values (2) | 2 |
Length
| Max length | 2 |
|---|---|
| Median length | 0 |
| Mean length | 0.3770491803 |
| Min length | 0 |
Characters and Unicode
| Total characters | 92 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | |
|---|---|
| 2nd row | 1 |
| 3rd row | |
| 4th row | 11 |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 178 | ||
| 1 | 37 | 15.2% |
| 99 | 19 | 7.8% |
| 11 | 7 | 2.9% |
| 4 | 1 | 0.4% |
| 2 | 1 | 0.4% |
| 3 | 1 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 37 | |
| 99 | 19 | |
| 11 | 7 | 10.6% |
| 4 | 1 | 1.5% |
| 2 | 1 | 1.5% |
| 3 | 1 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 51 | |
| 9 | 38 | |
| 3 | 1 | 1.1% |
| 4 | 1 | 1.1% |
| 2 | 1 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 92 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 51 | |
| 9 | 38 | |
| 3 | 1 | 1.1% |
| 4 | 1 | 1.1% |
| 2 | 1 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 92 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 51 | |
| 9 | 38 | |
| 3 | 1 | 1.1% |
| 4 | 1 | 1.1% |
| 2 | 1 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 51 | |
| 9 | 38 | |
| 3 | 1 | 1.1% |
| 4 | 1 | 1.1% |
| 2 | 1 | 1.1% |
| Distinct | 16 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.0 KiB |
| ARTESUNATO + MEFLOQUINA | 3 |
|---|---|
| ARTESUNATO+MEFLOQUINA | 2 |
| ARTESUNATO E MEFLOQUINA | 2 |
| ARTESU+MEFL+CLORIDRATO | 1 |
| Other values (11) | 11 |
Length
| Max length | 30 |
|---|---|
| Median length | 0 |
| Mean length | 1.823770492 |
| Min length | 0 |
Characters and Unicode
| Total characters | 445 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 4.9% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 225 | ||
| ARTESUNATO + MEFLOQUINA | 3 | 1.2% |
| ARTESUNATO+MEFLOQUINA | 2 | 0.8% |
| ARTESUNATO E MEFLOQUINA | 2 | 0.8% |
| ARTESU+MEFL+CLORIDRATO | 1 | 0.4% |
| ART+MEF+CLORIDRATO | 1 | 0.4% |
| ARTESUNARO + MEFLOQUINA | 1 | 0.4% |
| CLOROQUINA 3 E PRIMAQUINA 11 D | 1 | 0.4% |
| ARTESUNATO120+CLINDAMICINA45O | 1 | 0.4% |
| CLOROQUINA3DIAS PRIMAQUINA14DI | 1 | 0.4% |
| Other values (6) | 6 | 2.5% |
Length
| Value | Count | Frequency (%) |
| artesunato | 6 | |
| mefloquina | 6 | |
| 4 | 9.3% | |
| e | 4 | 9.3% |
| primaquina | 2 | 4.7% |
| artesunato+mefloquina | 2 | 4.7% |
| cloroq | 2 | 4.7% |
| mefloquina+artesunato | 1 | 2.3% |
| artesu+mefl+cloridrato | 1 | 2.3% |
| 3 | 1 | 2.3% |
| Other values (14) | 14 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 54 | |
| O | 38 | 8.5% |
| E | 31 | 7.0% |
| I | 31 | 7.0% |
| R | 30 | 6.7% |
| U | 30 | 6.7% |
| N | 30 | 6.7% |
| T | 28 | 6.3% |
| 24 | 5.4% | |
| M | 20 | 4.5% |
| Other values (14) | 129 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 389 | |
| Space Separator | 24 | 5.4% |
| Math Symbol | 17 | 3.8% |
| Decimal Number | 15 | 3.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 54 | |
| O | 38 | |
| E | 31 | |
| I | 31 | |
| R | 30 | 7.7% |
| U | 30 | 7.7% |
| N | 30 | 7.7% |
| T | 28 | 7.2% |
| M | 20 | 5.1% |
| Q | 20 | 5.1% |
| Other values (6) | 77 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 3 | 3 | |
| 0 | 2 | 13.3% |
| 4 | 2 | 13.3% |
| 2 | 1 | 6.7% |
| 5 | 1 | 6.7% |
Space Separator
| Value | Count | Frequency (%) |
| 24 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 389 | |
| Common | 56 | 12.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 54 | |
| O | 38 | |
| E | 31 | |
| I | 31 | |
| R | 30 | 7.7% |
| U | 30 | 7.7% |
| N | 30 | 7.7% |
| T | 28 | 7.2% |
| M | 20 | 5.1% |
| Q | 20 | 5.1% |
| Other values (6) | 77 |
Common
| Value | Count | Frequency (%) |
| 24 | ||
| + | 17 | |
| 1 | 6 | 10.7% |
| 3 | 3 | 5.4% |
| 0 | 2 | 3.6% |
| 4 | 2 | 3.6% |
| 2 | 1 | 1.8% |
| 5 | 1 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 445 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 54 | |
| O | 38 | 8.5% |
| E | 31 | 7.0% |
| I | 31 | 7.0% |
| R | 30 | 6.7% |
| U | 30 | 6.7% |
| N | 30 | 6.7% |
| T | 28 | 6.3% |
| 24 | 5.4% | |
| M | 20 | 4.5% |
| Other values (14) | 129 |
| Distinct | 51 |
|---|---|
| Distinct (%) | 20.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.1 KiB |
| None | |
|---|---|
| 2015-02-19 | 6 |
| 2015-03-02 | 4 |
| 2015-02-12 | 3 |
| 2015-05-05 | 3 |
| Other values (46) |
Length
| Max length | 10 |
|---|---|
| Median length | 4 |
| Mean length | 5.647540984 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1378 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 42 ? |
|---|---|
| Unique (%) | 17.2% |
Sample
| 1st row | None |
|---|---|
| 2nd row | 2015-01-06 |
| 3rd row | None |
| 4th row | 2015-01-09 |
| 5th row | None |
Common Values
| Value | Count | Frequency (%) |
| None | 177 | |
| 2015-02-19 | 6 | 2.5% |
| 2015-03-02 | 4 | 1.6% |
| 2015-02-12 | 3 | 1.2% |
| 2015-05-05 | 3 | 1.2% |
| 2015-03-05 | 3 | 1.2% |
| 2015-02-23 | 2 | 0.8% |
| 2015-01-12 | 2 | 0.8% |
| 2015-04-06 | 2 | 0.8% |
| 2015-04-07 | 1 | 0.4% |
| Other values (41) | 41 | 16.8% |
Length
| Value | Count | Frequency (%) |
| none | 177 | |
| 2015-02-19 | 6 | 2.5% |
| 2015-03-02 | 4 | 1.6% |
| 2015-02-12 | 3 | 1.2% |
| 2015-05-05 | 3 | 1.2% |
| 2015-03-05 | 3 | 1.2% |
| 2015-02-23 | 2 | 0.8% |
| 2015-01-12 | 2 | 0.8% |
| 2015-04-06 | 2 | 0.8% |
| 2015-04-07 | 1 | 0.4% |
| Other values (41) | 41 | 16.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 177 | |
| o | 177 | |
| n | 177 | |
| e | 177 | |
| 0 | 155 | |
| - | 134 | |
| 1 | 121 | |
| 2 | 109 | |
| 5 | 82 | |
| 3 | 21 | 1.5% |
| Other values (5) | 48 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 536 | |
| Lowercase Letter | 531 | |
| Uppercase Letter | 177 | 12.8% |
| Dash Punctuation | 134 | 9.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 155 | |
| 1 | 121 | |
| 2 | 109 | |
| 5 | 82 | |
| 3 | 21 | 3.9% |
| 9 | 15 | 2.8% |
| 4 | 12 | 2.2% |
| 6 | 10 | 1.9% |
| 8 | 6 | 1.1% |
| 7 | 5 | 0.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 177 | |
| n | 177 | |
| e | 177 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 177 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 134 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 708 | |
| Common | 670 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 155 | |
| - | 134 | |
| 1 | 121 | |
| 2 | 109 | |
| 5 | 82 | |
| 3 | 21 | 3.1% |
| 9 | 15 | 2.2% |
| 4 | 12 | 1.8% |
| 6 | 10 | 1.5% |
| 8 | 6 | 0.9% |
Latin
| Value | Count | Frequency (%) |
| N | 177 | |
| o | 177 | |
| n | 177 | |
| e | 177 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1378 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 177 | |
| o | 177 | |
| n | 177 | |
| e | 177 | |
| 0 | 155 | |
| - | 134 | |
| 1 | 121 | |
| 2 | 109 | |
| 5 | 82 | |
| 3 | 21 | 1.5% |
| Other values (5) | 48 | 3.5% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| TP_NOT | ID_AGRAVO | DT_NOTIFIC | SEM_NOT | NU_ANO | SG_UF_NOT | ID_MUNICIP | ID_REGIONA | ID_UNIDADE | DT_SIN_PRI | SEM_PRI | DT_NASC | NU_IDADE_N | CS_SEXO | CS_GESTANT | CS_RACA | CS_ESCOL_N | SG_UF | ID_MN_RESI | ID_RG_RESI | ID_PAIS | DT_INVEST | ID_OCUPA_N | CLASSI_FIN | AT_ATIVIDA | AT_LAMINA | AT_SINTOMA | TPAUTOCTO | COUFINF | COPAISINF | COMUNINF | LOC_INF | DEXAME | RESULT | PMM | PCRUZ | TRA_ESQUEM | DSTRAESQUE | DTRATA | DT_ENCERRA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | B54 | 2015-01-02 | 201453 | 2015 | 33 | 330455 | 5476321 | 2014-12-31 | 201453 | 2012-07-16 | 4002 | M | 6 | 4 | 10 | 33 | 330455 | 1 | NaT | 2 | 11 | 1 | 1 | 0 | 2015-01-02 | 1 | NaN | None | NaT | ||||||||||
| 1 | 2 | B54 | 2015-01-06 | 201501 | 2015 | 33 | 330455 | 2288338 | 2014-12-22 | 201452 | 1979-05-11 | 4035 | F | 5 | 1 | 08 | 33 | 330070 | 1 | NaT | 221105 | 1 | 10 | 2 | 1 | 2 | AM | 1 | 130260 | 2015-01-06 | 4 | 5040.0 | 4 | 1 | 2015-01-06 | NaT | ||||
| 2 | 2 | B54 | 2015-01-08 | 201501 | 2015 | 33 | 330610 | 3211649 | 2014-12-20 | 201451 | 1947-07-18 | 4067 | M | 6 | 1 | 08 | 33 | 330610 | 1 | NaT | 2 | 4 | 1 | 1 | 0 | 2015-01-08 | 1 | NaN | None | NaT | ||||||||||
| 3 | 2 | B54 | 2015-01-09 | 201501 | 2015 | 33 | 330455 | 2273365 | 2015-01-05 | 201501 | 1976-07-17 | 4038 | M | 6 | 2 | 07 | 33 | 330455 | 1 | NaT | 782305 | 1 | 11 | 2 | 1 | 2 | 31 | LUAN | 2015-01-09 | 2 | 11307.0 | 5 | 11 | 2015-01-09 | NaT | |||||
| 4 | 2 | B54 | 2015-01-09 | 201501 | 2015 | 33 | 330455 | 3005992 | 2015-01-08 | 201501 | 1984-02-21 | 4030 | M | 6 | 4 | 08 | 33 | 330455 | 1 | NaT | 262105 | 2 | 10 | 2 | 1 | 0 | 2015-01-09 | 1 | NaN | None | NaT | |||||||||
| 5 | 2 | B54 | 2015-01-12 | 201502 | 2015 | 33 | 330455 | 2288338 | 2015-01-05 | 201501 | 1974-04-27 | 4040 | M | 6 | 9 | 33 | 330330 | 1 | NaT | 1 | 11 | 3 | 1 | 2 | 128 | 2015-01-12 | 2 | 16.0 | 1 | 99 | ARTESUNARO + MEFLOQUINA | 2015-01-12 | NaT | |||||||
| 6 | 2 | B54 | 2015-01-12 | 201502 | 2015 | 33 | 330455 | 2270609 | 2015-01-09 | 201501 | 1974-04-20 | 4040 | F | 5 | 9 | 09 | 33 | 330455 | 1 | NaT | 516125 | 1 | 10 | 2 | 1 | 2 | 31 | LUAN | 2015-01-12 | 2 | 467.0 | 3 | 99 | MEFLOQUINA+ARTESUNATO | 2015-01-12 | NaT | ||||
| 7 | 2 | B54 | 2015-01-12 | 201502 | 2015 | 33 | 330610 | 3211649 | 2015-01-10 | 201501 | 1949-07-18 | 4065 | M | 6 | 1 | 08 | 33 | 330610 | 1 | NaT | 141405 | 2 | 11 | 2 | 1 | 0 | 2015-01-12 | 1 | NaN | None | NaT | |||||||||
| 8 | 2 | B54 | 2015-01-13 | 201502 | 2015 | 33 | 330455 | 5462886 | 2015-01-11 | 201502 | 1962-04-19 | 4052 | F | 6 | 4 | 06 | 33 | 330455 | 1 | NaT | 2 | 10 | 1 | 1 | 0 | 2015-01-13 | 1 | NaN | None | NaT | ||||||||||
| 9 | 2 | B54 | 2015-01-19 | 201503 | 2015 | 33 | 330455 | 2288338 | 2015-01-15 | 201502 | 1981-08-05 | 4033 | M | 6 | 1 | 08 | 33 | 330630 | 1 | NaT | 1 | 10 | 2 | 1 | 2 | 22 | 2015-01-19 | 2 | 2320.0 | 4 | 11 | 2015-01-19 | NaT |
Last rows
| TP_NOT | ID_AGRAVO | DT_NOTIFIC | SEM_NOT | NU_ANO | SG_UF_NOT | ID_MUNICIP | ID_REGIONA | ID_UNIDADE | DT_SIN_PRI | SEM_PRI | DT_NASC | NU_IDADE_N | CS_SEXO | CS_GESTANT | CS_RACA | CS_ESCOL_N | SG_UF | ID_MN_RESI | ID_RG_RESI | ID_PAIS | DT_INVEST | ID_OCUPA_N | CLASSI_FIN | AT_ATIVIDA | AT_LAMINA | AT_SINTOMA | TPAUTOCTO | COUFINF | COPAISINF | COMUNINF | LOC_INF | DEXAME | RESULT | PMM | PCRUZ | TRA_ESQUEM | DSTRAESQUE | DTRATA | DT_ENCERRA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 234 | 2 | B54 | 2015-12-11 | 201549 | 2015 | 33 | 330455 | 2288338 | 2015-12-08 | 201549 | 1978-06-24 | 4037 | M | 6 | 1 | 08 | 33 | 330455 | 1 | NaT | 253105 | 1 | 10 | 1 | 1 | 2 | 28 | 2015-12-11 | 2 | NaN | 5 | 99 | ARTESUNATO | 2015-12-11 | NaT | |||||
| 235 | 2 | B54 | 2015-12-11 | 201549 | 2015 | 33 | 330240 | 2276534 | 2015-12-08 | 201549 | 1958-01-13 | 4057 | M | 6 | 1 | 09 | 33 | 330240 | 1 | NaT | 2 | 11 | 1 | 2 | 0 | 2015-12-11 | 1 | NaN | None | NaT | ||||||||||
| 236 | 2 | B54 | 2015-12-14 | 201550 | 2015 | 33 | 330455 | 5462886 | 2015-12-11 | 201549 | 1964-02-26 | 4051 | F | 6 | 1 | 09 | 33 | 330455 | 1 | NaT | 354145 | 2 | 10 | 2 | 1 | 0 | 2015-12-14 | 1 | NaN | None | NaT | |||||||||
| 237 | 2 | B54 | 2015-12-14 | 201550 | 2015 | 33 | 330455 | 2288338 | 2015-12-04 | 201548 | 1980-02-23 | 4035 | M | 6 | 1 | 08 | 33 | 330330 | 1 | NaT | 521110 | 2 | 10 | 1 | 1 | 0 | 2015-12-14 | 1 | NaN | None | NaT | |||||||||
| 238 | 2 | B54 | 2015-12-15 | 201550 | 2015 | 33 | 330455 | 2288338 | 2015-11-12 | 201545 | 1943-07-17 | 4072 | M | 6 | 1 | 06 | 33 | 330455 | 1 | NaT | 2 | 11 | 1 | 1 | 0 | 2015-12-15 | 1 | NaN | None | NaT | ||||||||||
| 239 | 2 | B54 | 2015-12-16 | 201550 | 2015 | 33 | 330455 | 2288338 | 2015-12-12 | 201549 | 1943-07-15 | 4072 | M | 6 | 9 | 08 | 33 | 330455 | 1 | NaT | 214405 | 2 | 10 | 1 | 1 | 0 | 2015-12-16 | 1 | NaN | None | NaT | |||||||||
| 240 | 2 | B54 | 2015-12-17 | 201550 | 2015 | 33 | 330455 | 5462886 | 2015-12-16 | 201550 | 1958-05-08 | 4057 | M | 6 | 1 | 07 | 33 | 330455 | 1 | NaT | 2 | 10 | 1 | 1 | 0 | 2015-12-17 | 1 | NaN | None | NaT | ||||||||||
| 241 | 2 | B54 | 2015-12-18 | 201550 | 2015 | 33 | 330455 | 2288338 | 2015-09-17 | 201537 | 1987-07-31 | 4028 | M | 6 | 9 | 06 | 33 | 330455 | 1 | NaT | 711405 | 1 | 5 | 3 | 1 | 2 | 138 | 2015-12-18 | 4 | 5680.0 | 4 | 1 | 2015-12-18 | NaT | ||||||
| 242 | 2 | B54 | 2015-12-21 | 201551 | 2015 | 33 | 330455 | 2288338 | 2015-12-14 | 201550 | 1962-09-03 | 4053 | M | 6 | 9 | 09 | 33 | 330455 | 1 | NaT | 2 | 10 | 1 | 1 | 0 | 2015-12-21 | 1 | NaN | None | NaT | ||||||||||
| 243 | 2 | B54 | 2015-12-26 | 201551 | 2015 | 33 | 330455 | 2269546 | 2015-12-25 | 201551 | 1969-07-11 | 4046 | M | 6 | 9 | 09 | 33 | 330455 | 1 | NaT | 998999 | 2 | 9 | 1 | 1 | 0 | 2015-12-26 | 1 | NaN | None | NaT |